22 research outputs found
An efficiency curve for evaluating imbalanced classifiers considering intrinsic data characteristics: Experimental analysis
Balancing the accuracy rates of the majority and minority classes is challenging in imbalanced
classification. Furthermore, data characteristics have a significant impact on the performance
of imbalanced classifiers, which are generally neglected by existing evaluation
methods. The objective of this study is to introduce a new criterion to comprehensively
evaluate imbalanced classifiers. Specifically, we introduce an efficiency curve that is established
using data envelopment analysis without explicit inputs (DEA-WEI), to determine
the trade-off between the benefits of improved minority class accuracy and the cost of
reduced majority class accuracy. In sequence, we analyze the impact of the imbalanced
ratio and typical imbalanced data characteristics on the efficiency of the classifiers.
Empirical analyses using 68 imbalanced data reveal that traditional classifiers such as
C4.5 and the k-nearest neighbor are more effective on disjunct data, whereas ensemble
and undersampling techniques are more effective for overlapping and noisy data. The efficiency
of cost-sensitive classifiers decreases dramatically when the imbalanced ratio
increases. Finally, we investigate the reasons for the different efficiencies of classifiers on
imbalanced data and recommend steps to select appropriate classifiers for imbalanced data
based on data characteristics.National Natural Science Foundation of China (NSFC) 71874023
71725001
71771037
7197104
How electronic word of mouth dynamically influences product sales and supplies: an evidence from China film industry
As an important part of e-commerce, online reviews play a significant role in consumers’ purchase decisions. This study investigated the dynamic effects of electronic word of mouth (eWOM)
and the number of people wishing to watch a movie on movie
sales and supplies in the Chinese movie market. Using a dynamic
simultaneous equation system and data of 76 films, this study
analyzed the interrelationships between eWOM and movie sales
and supplies. Our findings showed that both the volume and
valence of eWOM affected movie sales and supplies significantly.
The number of people who wanted to watch a movie had an
opposite effect on movie sales and supply; eWOM volume had a
positive impact on movie sales and supplies; and eWOM valence
had a negative impact on movie sales and a positive impact on
movie supplies. The number of people who wish to watch a
movie was another important variable for movie sales and supplies, and it had a negative impact on the daily movie sales but a
positive impact on the daily movie supplies. This study provided
a detailed explanation of these results and thus contributed to
improving the efficiency of movie suppliers’ utilization of
online reviews
A Similarity Measure-based Optimization Model for Group Decision Making with Multiplicative and Fuzzy Preference Relations
Group decision making (GDM) problem based on different preference relations aims to obtain a collective opinion based on various preference structures provided by a group of decision makers (DMs) or experts, those who have varying backgrounds and interests in real world. The decision process in proposed question includes three steps: integrating varying preference structures, reaching consensus opinion, selecting the best alternative. Two major approaches: preference transformation and optimization methods have been developed to deal with the issue in first step. However, the transformation processes causes information lose and existing optimization methods are so computationally complex that it is not easy to be used by management practice. This study proposes a new consistency-based method to integrate multiplicative and fuzzy preference relations, which is based on a cosine similarity measure to derive a collective priority vector. The basic idea is that a collective priority vector should be as similar per column as possible to a pairwise comparative matrix (PCM) in order to assure the group preference has highest consistency for each decision makers. The model is computationally simple, because it can be solved using a Lagrangian approach and obtain a collective priority vector following four simple steps. The proposed method can further used to derive priority vector of fuzzy AHP. Using three illustrative examples, the effectiveness and simpleness of the proposed model is demonstrated by comparison with other methods. The results show that the proposed model achieves the largest cosine values in all three examples, indicating the solution is the nearest theoretical perfectly consistent opinion for each decision makers
Machine learning methods for systemic risk analysis in financial sectors.
Financial systemic risk is an important issue in economics and financial systems. Trying
to detect and respond to systemic risk with growing amounts of data produced in financial markets
and systems, a lot of researchers have increasingly employed machine learning methods. Machine
learning methods study the mechanisms of outbreak and contagion of systemic risk in the financial
network and improve the current regulation of the financial market and industry. In this paper, we
survey existing researches and methodologies on assessment and measurement of financial systemic
risk combined with machine learning technologies, including big data analysis, network analysis
and sentiment analysis, etc. In addition, we identify future challenges, and suggest further research
topics. The main purpose of this paper is to introduce current researches on financial systemic risk
with machine learning methods and to propose directions for future work.This research has been partially supported by grants from the National Natural Science Foundation
of China (#U1811462, #71874023, #71771037, #71725001, and #71433001)
Behavior monitoring methods for trade-based money laundering integrating macro and micro prudential regulation: a case from China
Trade-based Money Laundering, a new form of money laundering using international trade as a signboard, always appears along with speculative capital movement which has been accepted as the most concerned and consensus incentive giving rise to the collapse of the financial market. Unfortunately, preventing money laundering is very difficult since money laundering always has a plausible trade characterization. To reach this goal, supervision for regulator and financial institutions aims to effectively monitor micro entities’ behavior in financial markets. The main purpose of this paper is to establish a monitoring method including accurate recognition and classified supervision for Trade-based Money Laundering by means of knowledge-driven multi-class classification algorithms associated with macro and micro prudential regulation, such that the model can forecast the predicted class from the concerned management areas. Based on empirical data from China, we demonstrate the application and explain how the monitor method can help to improve management efficiency in the financial market.
First published online 8 May 201
Machine learning methods for systemic risk analysis in financial sectors
Financial systemic risk is an important issue in economics and financial systems. Trying to detect and respond to systemic risk with growing amounts of data produced in financial markets and systems, a lot of researchers have increasingly employed machine learning methods. Machine learning methods study the mechanisms of outbreak and contagion of systemic risk in the financial network and improve the current regulation of the financial market and industry. In this paper, we survey existing researches and methodologies on assessment and measurement of financial systemic risk combined with machine learning technologies, including big data analysis, network analysis and sentiment analysis, etc. In addition, we identify future challenges, and suggest further research topics. The main purpose of this paper is to introduce current researches on financial systemic risk with machine learning methods and to propose directions for future work.
First published online 6 May 201
An efficient consensus reaching framework for large-scale social network group decision making and its application in urban resettlement
The authors thank the editor and the anonymous referees for their valuable comments and insightful recommendations. This work was supported in part by grants from the National Natural Science Foundation of China (#71874023, #71771037, #71971042, #71910107002, and #71725001) and supported by the Spanish State Research Agency under Project PID2019-103880RB-I00/AEI/10.13039/501100011033.Urban resettlement projects involve a large number of stakeholders and impose tremendous
cost. Developing resettlement plans and reaching an agreement amongst stakeholders
about resettlement plans at a reasonable cost are some of the key issues in urban
resettlement. From this perspective, urban resettlement is a typical large-scale group
decision-making (GDM) problem, which is challenging because of the scale of participants
and the requirement of high consensus levels. Observing that residents who are affected by
a resettlement project often have tight social connections, this study proposes a framework
to improve the consensus reaching and uses the minimum consensus cost to reduce the
total cost for urban resettlement projects with more than 1000 participants. Firstly, we
construct a network topology that consists of two layers to deal with incomplete social
relationships amongst large-scale participants. An inner layer consists of participants
whose preference similarities and trust relations are known. Meanwhile, an outside layer
includes participants whose trust relations cannot be determined. Secondly, we develop
a classification method to classify participants into small subgroups based on their preference
similarities. We can then connect the participants whose trust relations are unknown
(the outside layer) with the ones in the inner layer using the classification results. To facilitate
effective consensus reaching in large-scale social network GDM, we develop a threestep
approach to reconcile conflicting preferences and accelerate the consensus process at
the minimum cost. A real-life urban resettlement example is used to validate the proposed
approach. Results show that the proposed approach can reduce the total consensus cost
compared with the other two practices used in the actual urban resettlement operations.National Natural Science Foundation of China (NSFC) 71874023
71771037
71971042
71910107002
71725001Spanish State Research Agency PID2019-103880RB-I00/AEI/10.13039/50110001103
Large-scale group decision-making with non-cooperative behaviors and heterogeneous preferences: An application in financial inclusion
The authors thank the editor and anonymous referees for their valuable comments and insightful recommendations, and thank Prof. Yucheng Dong for the helpful suggestions. This research was supported in part by grants from the National Natural Science Foundation of China (#71874023, #U1811462, #71725001, #71771037, #71971042, #71910107002) and Major project of the National Social Science Foundation of China (#19ZDA092). Enrique Herrera-Viedma is supported by the FEDER funds in the project TIN2016-75850-R.Non-cooperative behavior is a common situation in large-scale group decision-making (LSGDM) problems. In addition, decision makers in LSGDM often use different preference formats to express their opinions, due to their educational backgrounds, knowledge, and experiences. Heterogeneous preference information and non-cooperative behaviors bring challenges to LSGDM. This study develops a consensus reaching model to address heterogeneous LSGDM with non-cooperative behaviors and discuss its application in financial inclusion. Specifically, the cosine similarity degree is introduced to build a distance measure for different preference structures. Clustering analysis is employed to divide large-scale groups and handle non-cooperative behaviors in LSGDM. A consensus degree and a weighting process are proposed to decrease the influence of non-cooperative behaviors and facilitate the consensus reaching process. The convergence of the proposed approach is proven by theoretical and simulation analyses. Experimental studies are carried out to compare the performances of the proposed approach with existing methods. Finally, a real-life example from the “targeted poverty reduction project” in China is presented to validate the proposed approach. The selection of beneficiaries in finance inclusion is difficult due to the lack of credit history, the large number of participants, and the conflicting views of participants. The results showed that the proposed consensus model can integrate opinions of participants using diverse preference formats and reach an agreement efficiently.National Natural Science Foundation of China (NSFC)
71874023
U1811462
71725001
71771037
71971042
71910107002Major project of the National Social Science Foundation of China
19ZDA092European Union (EU)
TIN2016-75850-
A geometrical method for consensus building in GDM with incomplete heterogeneous preference information
This work was supported in part by grants from the National Natural Science Foundation of China (#71874023, #71725001, #71910107002, #71771037, #71971042), and European Regional Development Fund (FEDER), European Union funds in the project TIN2016-75850-R.In real-life group decision-making (GDM) problems, the preferences given by decision-makers(DMs)
are often incomplete, because the complexity of decision-making problems and the limitation of
knowledge of DM make it difficult for DMs to take a determined evaluation of alternatives. In
addition, preference relations provided by DMs are often heterogeneous because they always have
different decision habits and hobbies. However, the consensus method for GDM under incomplete
heterogeneous preference relations is rarely studied. For four common preference relations: utility
values, preference orderings, and (incomplete) multiplicative preference relations and (incomplete)
fuzzy preference relations, this paper proposes a geometrical method for consensus building in GDM.
Specifically, we integrate incomplete heterogeneous preference structures using a similarity-based
optimization model and set a corresponding geometrical consensus measurement. Then, preference
modification and weighting processes are proposed to improve consensus degree. Finally, we conduct
a comparison analysis based on a qualitative analysis and algorithm complexity analysis of existing
consensus reaching methods. Numerical analyses and convergence tests show that our method can
promote the improvement of the consensus degree in GDM, and has less time complexity than the
previous methods. The proposed geometrical method is a more explainable model due to operability
and simplicity.National Natural Science Foundation of China (NSFC)
71874023
71725001
71910107002
71771037
71971042European Regional Development Fund (FEDER), European Union
TIN2016-75850-